Efficient Earley Parsing with Regular Right-hand Sides

نویسندگان

  • Trevor Jim
  • Yitzhak Mandelbaum
چکیده

We present a new variant of the Earley parsing algorithm capable of efficiently supporting context-free grammars with regular right hand-sides. We present the core state-machine driven algorithm, the translation of grammars into state machines, and the reconstruction algorithm. We also include a theoretical framework for presenting the algorithm and for evaluating optimizations. Finally, we evaluate the algorithm by testing its implementation.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Practical Earley Parsing

Earley’s parsing algorithm is a general algorithm, able to handle any context-free grammar. As with most parsing algorithms, however, the presence of grammar rules having empty right-hand sides complicates matters. By analyzing why Earley’s algorithm struggles with these grammar rules, we have devised a simple solution to the problem. Our empty-rule solution leads to a new type of finite automa...

متن کامل

ar X iv : c m p - lg / 9 80 80 17 v 1 3 1 A ug 1 99 8 A Variant of Earley Parsing

The Earley algorithm is a widely used parsing method in natural language processing applications. We introduce a variant of Earley parsing that is based on a “delayed” recognition of constituents. This allows us to start the recognition of a constituent only in cases in which all of its subconstituents have been found within the input string. This is particularly advantageous in several cases i...

متن کامل

A Variant of Early Parsing

The Earley algorithm is a widely used parsing method in natural language processing applications. We introduce a variant of Earley parsing that is based on a “delayed” recognition of constituents. This allows us to start the recognition of a constituent only in cases in which all of its subconstituents have been found within the input string. This is particularly advantageous in several cases i...

متن کامل

Partially Ordered Multiset Context-free Grammars and Free-word-order Parsing

We present a new formalism, partially ordered multiset context-free grammars (pomsCFG), along with an Earley-style parsing algorithm. The formalism, which can be thought of as a generalization of context-free grammars with partially ordered right-hand sides, is of interest in its own right, and also as infrastructure for obtaining tighter complexity bounds for more expressive context-free forma...

متن کامل

Parsing Contextual Grammars with Linear, Regular and Context-Free Selectors

Contextual Grammars (CGs) provide an appropriate description of natural languages. Unfortunately, no parser which runs in polynomial time was known for some linguistically relevant classes. In this paper, an intertwined two–level Earley–based parser for CGs with finite, regular and context–free selectors is presented. In both phases context–free grammars are defined which identify individual se...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Electr. Notes Theor. Comput. Sci.

دوره 253  شماره 

صفحات  -

تاریخ انتشار 2010